BIOTECHNOLOGY <gsD > ^ 




RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: lOM °1 ~l J S ^ ^"F V 
Source: X^a/o , 

Date Processed by STIC: j C>—\ j[-OV 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2 PROGRAM ACCESSIBLE THROUGH THE U^S PATENT AND 
TRADEMARK OFFICE WEBSITE SEE BELOW FOR ADDRESS 
http://vvww.uspto.gov/wcb/ofnces/pac/chccker/chkrnotc.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 
a possibility that the disk/CD-Rom may have been afTccled by treatment given to all incoming mail 
Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom 
Any re ply including a sequence listing in electronic form should NOT be sent to the 2023 I zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses 

1 EFS-Bio (<h((n://www.uspto.eov/cbc/cfs/downloads/documcnts.h(m> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 06/05/04): 
U.S. Patent and Trademark Office, 220 20°' Street S., Customer Window, Mail Stop Sequence, Crystal Plaza Two Lobby 
Room 1B03, Arlington, VA 22202 ~ 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/7 97 , 553A 



DATE: 10/18/2004 
TIME: 14:23:13 



Input Set : A:\797, 553SeqList. txt 

Output Set: N:\CRF4\10182004\J797553A.raw 



3 <110> APPLICANT: Moyle, William R. 

4 Xing, Yonga 

6 <12 0> TITLE OF INVENTION: Protein Knobs 
8 <130> FILE REFERENCE: Moyl-005US 

10 < 14 0> CURRENT APPLICATION NUMBER: 10/797, 553A 

11 <141> CURRENT FILING DATE: 2004-03-10 

13 <150> PRIOR APPLICATION NUMBER: PCT/US02/35914 

14 <151> PRIOR FILING DATE: 2002-11-08 
16 <160> NUMBER OF SEQ ID NOS : 66 

18 <170> SOFTWARE: Patentln version 3.3 . 

20 <210> SEQ ID NO: 1 OVi A'A'] 

21 <211> LENGTH: 92 ^J>\ £ 

22 <212> TYPE: PRT- f T 1 f^J^^ 

23 <213> ORGANISl^ Artificial ^-^-^<C~^^ ^ 
2 5 <22 0> FEATURE :V .^^^^ 
26 <223> OTHER INFORMA¥*OTTT^rt if icial Sequence 
2 8 <40 0> SEQUENCE: 1 V 



°t c 




3 0 Ala Pro Asp Val Gin Asp Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro 
31 1 5 10 15 

34 Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys Met Gly Cys Cys 

35 20 25 30 
3 8 Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 
39 35 40 45 

42 Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 

43 50 55 60 

46 Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 

47 65 70 75 80 
5 0 Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 
51 85 90 

54 <210> SEQ ID NO: 2 

55 <211> LENGTH: 92 

56 <212> TYPE: PRT 

57 <213> ORGANISM 
5 9 <220> FEATURE 

60 <223> OTHER INF75RMftTl 0fr : Artificial Sequence - ^ , 

62 <4 00> SEQUENCE: 2 — {cPSy^ 

64 Ala Pro Asp Val Cys Asp Cys Pro Glu Cys Thr Leu Gin Glu Asn Pro 

6 5 1 5 10 15 

68 Phe Phe Ser Gin Pro Gly Ala Pro He Leu Gin Cys Met Gly Cys Cys 

69 20 25 30 

72 Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 

73 35 40 " 45 

76 Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 
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RAW SEQUENCE LISTING DATE: 10/18/2004 

PATENT APPLICATION: US/10/7 97 , 553A TIME: 14:23:13 



Input Set : A:\797, 553SeqList.txt 
Output Set: N:\CRF4\10182004\J797553A.raw 

77 50 55 60 

8 0 Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 

81 65 70 75 80 

84 Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 



85 
88 
89 
90 



<210> SEQ ID NO 



85 
3 



90 




<211> LENGTH: 92 
<212> TYPE: PRT 
91 <213> ORGANISM: Artificial ; 

93 <22 0> FEATURE: 

94 <223> OTHER INFORMATION artificial sequence 




96 <400> SEQUENCE: 3 

98 Ala Pro Asp Val Gin Asp Cys" Pro Glu Cys Thr Cys Gin Glu Asn Pro 

99 1 5 10 ' 15 

102 Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys Met Gly Cys Cys 

103 20 25 30 

106 Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 

107 35 40 45 

110 Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 

111 50 55 60 

114 Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 

115 65 70 75 80 
118 Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 



119 
122 



85 



90 



<210> SEQ ID NO: 4 

123 <211> LENGTH: 92 

124 <212> TYPE: PRT 

125 <213> ORGANISM: Artificial 
12 7 <22 0> FEATURE: 

128 <223> OTHER INFORMATION: Artificial sequence 
130 <400> SEQUENCE: 4 f ^ 

132 Ala Pro Asp Val Gin AsV-Ty^i Pro Glu eys^TEr^Leu Gin Glu Cys Pro 

133 15 10 15 

136 Phe Phe Ser Gin Pro Gly Ala Pro lie Leu Gin Cys Met Gly Cys Cys 

137 20 25 * 30 

140 Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 

141 35 40 ~ 45 

144 Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 

145 50 55 60 

14 8 Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 
149 65 70 75 80 

152 Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 



153 
156 



85 



90 



<210> SEQ ID NO: 5 
157 <211> LENGTH: 92 
15 8 <212> TYPE: PRT 
159 <213> ORGANISM: Artifi 

161 <220> FEATURE: 

162 <223> OTHER INFORMATION 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/7 97 , 553A 



DATE: 10/18/2004 
TIME: 14:23:13 



Input Set : A:\797, 553SeqList . txt 

Output Set: N:\CRF4\10l82004\J797553A.raw 



164 <400> SEQUENCE: 5 



166 


Ala 


Pro 


Asp 


Val 


Gin 


Asp 


Cys 


Pro 


Glu 


Cys 


Thr 


Leu 


Gin 


Glu 


Asn 


Pro 


167 


1 








5 










10 










15 




170 


Cys 


Phe 


Ser 


Gin 


Pro 


Gly 


Ala 


Pro 


He 


Leu 


Gin 


Cys 


Met 


Gly 


Cys 


Cys 


171 








20 










25 










30 




174 


Phe 


Ser 


Arg 


Ala 


Tyr 


Pro 


Thr 


Pro 


Leu 


Arg 


Ser 


Lys 


Lys 


Thr 


Met 


Leu 


175 






35 










40 










45 








178 


Val 


Gin 


Lys 


Asn 


Val 


Thr 


Ser 


Glu 


Ser 


Thr 


Cys 


Cys 


Val 


Ala 


Lys 


Ser 


179 




50 










55 










60 








182 


Tyr Asn 


Arg 


Val 


Thr 


Val 


Met 


Gly 


Gly 


Phe 


Lys 


Val 


Glu 


Asn 


His 


Thr 


183 


65 










70 










75 










80 


186 


Ala 


Cys 


His 


Cys 


Ser 


Thr 


Cys 


Tyr 


Tyr 


His 


Lys 


Ser 










187 










85 










90 
















190 
191 
192 
193 
195 
196 
198 
200 
201 
204 
205 
208 
209 
212 
213 
216 
217 
220 
221 
224 
225 
226 
227 
229 
230 
232 
234 
235 
238 
239 
242 
243 
246 
247 
250 
251 




<210> SEQ ID NO: 6 
<211> LENGTH: 92 
<212> TYPE: PRT 
<213> ORGANISM: Artificial 
<220> FEATURE: 
<223> OTHER INFORMATI 
<400> SEQUENCE: 6 
Ala Pro Asp Val Gin 



Cys Pro Glu CysJThr Leu Gin Glu Asn Pro 

15 

Phe Phe Ser Gin Pro Cys Ala Pro He Leu Gin Cys Met Gly Cys Cys 

20 25 " 30 

Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 

35 40 45 

Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 

50 55 60 

Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 
65 70 75 80 

Ala Cys His Cys Ser Thr Cys Tyr Tyr His Lys Ser 

85 90 
<210> SEQ ID NO: 7 
<211> LENGTH: 92 
<212> TYPE: PRT 

<213> ORGANISM: Artificial 

<22 0> FEATURE: 

<223> OTHER INFORMATION^/art if icial sequence 
<4 00> SEQUENCE: 7 
Ala Pro Asp Val Gin Asp^y*i Pro Clu Cys Thr Letr^ln Glu Asn Pro 
15 10 15 

Phe Phe Ser Gin Pro Gly Ala Pro He Leu Cys Cys Met Gly Cys Cys 

20 25 30 

Phe Ser Arg Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu 

35 40 45 

Val Gin Lys Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser 

50 55 60 

Tyr Asn Arg Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr 
65 70 75 80 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/7 97 , 553A 



DATE: 10/18/2004 
TIME: 14:23:13 



Input Set : A:\797, 553SeqList.txt 

Output Set: N:\CRF4\10182004\J797553A.raw 




2 54 Ala Cys His Cys Ser Thr Cys Tyr Tyr 
255 85 

258 <210> SEQ ID NO: 8 

259 <211> LENGTH: 92 

260 <212> TYPE: PRT 

261 <213> ORGANISM: Artificial 

263 <220> FEATURE: 

264 <223> OTHER INFORMATIO: 
266 <400> SEQUENCE: 8 

268 Ala Pro Asp Val Gin As 

269 1 5 

272 Phe Phe Ser Gin Pro Gly Ala Pro He 

273 20 25 

2 76 Phe Ser Arg Ala Tyr Pro Thr Pro Leu 
277 35 40 

280 Val Gin Lys Asn Val Thr Ser Glu Ser 

281 50 55 

2 84 Tyr Asn Arg Val Thr Val Met Gly Gly 
285 65 70 
2 88 Ala Cys His Cys Ser Thr Cys Tyr Tyr 

85 

<210> SEQ ID NO: 9 
<211> LENGTH: 92 

294 <212> TYPE: PRT 

295 <213> ORGANISM: Artificial ^ 

297 <220> FEATURE: S*^^ 

2 98 <223> OTHER INFORMATION i^art if icial 
300 <400> SEQUENCE: 9 ( 

3 02 Ala Pro Asp Val Gin Asp (Sy ^Pro Glu 
1 5 ~ 



His Lys Ser 
90 



289 
292 
293 



Cys TJaar Cys Gin Glu Asn Pro 

15 

Leu Gin Cys Met Gly Cys Cys 
30 

Arg Ser Lys Lys Thr Met Leu 
45 

Thr Cys Cys Val Ala Lys Ser 
6 0 

Phe Lys Val Glu Asn His Thr 

75 80 
His Lys Ser 
90 



sequence 




303 
306 
307 



Phe Phe Ser Gin Pro Gly Ala Pro He 
20 25 

310 Phe Ser Cys Ala Tyr Pro Thr Pro Leu 

311 35 40 

314 Val Gin Lys Asn Val Thr Ser Glu Ser 

315 50 55 

318 Tyr Asn Arg Val Thr Val Met Gly Gly 

319 65 70 

322 Ala Cys His Cys Ser Thr Cys Tyr Tyr 



323 



85 



<££S~-*¥fix Leu Gin Glu Asn Pro 
10 15 
Leu Gin Cys Met Gly Cys Cys 
30 

Arg Ser Lys Lys Thr Met Leu 
45 

Thr Cys Cys Val Ala Lys Ser 
60 

Phe Lys Val Glu Asn His Thr 

75 80 
His Lys Ser 
90 



326 <210> SEQ ID NO: 10 

327 <211> LENGTH: 92 

328 <212> TYPE: PRT 

329 <213> ORGANISM: Artificial 

331 <22 0> FEATURE: 

332 <223> OTHER INFORMATION 
334 <400> SEQUENCE: 10 

336 Ala Pro Asp Val Gin As 

337 1 5 




Leu Gin Glu Asn Pro 
15 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/7 97 , 553A 



DATE: 10/18/2004 
TIME: 14:23:13 



Input Set : A:\797, 553SeqList.txt 

Output Set: N:\CRF4\10182004\J797553A.raw 



340 Phe Phe Ser Gin Pro Gly Ala Pro lie 

341 20 25 

344 Phe Ser Arg Ala Cys Pro Thr Pro Leu 

345 35 40 

348 Val Gin Lys Asn Val Thr Ser Glu Ser 

349 50 55 

3 52 Tyr Asn Arg Val Thr Val Met Gly Gly 

353 65 70 

356 Ala Cys His Cys Ser Thr Cys Tyr Tyr 



357 
360 



85 



Leu Gin Cys Met Gly Cys Cys 

30 

Arg Ser Lys Lys Thr Met Leu 
45 

Thr Cys Cys Val Ala Lys Ser 
60 

Phe Lys Val Glu Asn His Thr 

75 80 
His Lys Ser 
90 



11 



363 
365 




<210> SEQ ID NO: 
361 <211> LENGTH: 92 
3 62 <212> TYPE: PRT 

<213> ORGANISM: Artificial 
<22 0> FEATURE 
366 <223> OTHER INFORMATION 
368 <400> SEQUENCE: 11 
3 70 Ala Pro Asp Val Gin As 
371 1 5 
3 74 Phe Phe Ser Gin Pro Gly Ala Pro lie 
375 20 25 

3 78 Phe Ser Arg Ala Tyr Cys Thr Pro Leu 
379 35 40 

3 82 Val Gin Lys Asn Val Thr Ser Glu Ser 
383 50 55 

3 86 Tyr Asn Arg Val Thr Val Met Gly Gly 
387 65 70 
3 90 Ala Cys His Cys Ser Thr Cys Tyr Tyr 

85 

<210> SEQ ID NO: 12 
LENGTH: 92 

396 <212> TYPE: PRT 

397 <213> ORGANISM: Artificial 
399 <220> FEATURE: 

<223> OTHER INFORMATION^ artificial 



391 
3 94 
395 <211> 



Cys Thr Leu Gin Glu Asn Pro 
10 15 
Leu Gin Cys Met Gly Cys Cys 
30 

Arg Ser Lys Lys Thr Met Leu 
45 

Thr Cys Cys Val Ala Lys Ser 
60 

Phe Lys Val Glu Asn His Thr 

75 80 
His Lys Ser 
90 



400 
402 



<400> SEQUENCE: 12 




Pro Gl 



404 Ala Pro Asp Val Gin 

405 1 5 

408 Phe Phe Ser Gin Pro Gly Ala Pro lie 

409 20 25 

412 Phe Ser Arg Ala Tyr Pro Cys Pro Leu 

413 35 40 

416 Val Gin Lys Asn Val Thr Ser Glu Ser 

417 50 55 

42 0 Tyr Asn Arg Val Thr Val Met Gly Gly 
421 65 70 

424 Ala Cys His Cys Ser Thr Cys Tyr Tyr 

425 85 
428 <210> SEQ ID NO: 13 




Leu Gin Glu Asn Pro 
10 15 
Leu Gin Cys Met Gly Cys Cys 
3 0 

Arg Ser Lys Lys Thr Met Leu 
45 

Thr Cys Cys Val Ala Lys Ser 
60 

Phe Lys Val Glu Asn His Thr 

75 80 
His Lys Ser 
90 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/lO/7 97 , 553A 



DATE: 10/18/2004 
TIME: 14:23:14 



Input Set : A:\797, 553SeqList.txt 

Output Set: N:\CRF4\10182004\J797553A.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:56; Xaa Pos . 1,8,10 

Invalid <213> Response: 

Use of "Artificial" only as H <213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#: 1,2, 3, 4 ,5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 ,17,18, 19,20,21,22,23,24,25,26,27 
Seq#: 28, 29, 30 ,31, 32 ,33, 34, 35,36,3 7,38,3 9,40,41,42,43,44,45,46,47,48,49,50, 51 
Seq#: 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66 
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VERIFICATION SUMMARY DATE: 10/18/2004 

PATENT APPLICATION: US/10/7 97 , 553A TIME: 14:23:14 

Input Set : A: \797 , 553SeqList ♦ txt 

Output Set: N:\CRF4\10182004\J797553A.raw 

L:2208 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:56 after pos . : 0 
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